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(54)TitIe: METHOD FOR SCREENING FULL-LENGTH cDNA CLONES 



(57) Abstract 

A method for efficiently screening full-length cDNA clones which comprises: determining the base sequence in the S'-region of each 
clone contained in a cDNA library prepared by a method for constructing a cDNA library involving full-length ones at a high ratio; 
examining the presence/absence of initiation ATG in this 5-region and the location thereof by using an originally developed software for 
anticipating initiation codons in cDNA; thus exactly judging the presence/absence of the initiation codon and the location thereof; and 
screening the cDNAs thus judged as carrying the initiation codon from the cDNA library. Moreover, a cDNA library containing full-length 
ones at an extremely high ratio can be constructed by mixing the clones thus selected above. 
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mmm 

^Mc DNA^ D — ><DMlRJ5i£. 

^±s©cdna^ p - > sat* J; < c h ^ii^^ ^ * y £ ft 2> o 

:£l£#*n£>ftTt>&o IP*.. mRNA©CAP£RNA'J — £6^lft££*£-r £^ 'J rf 

^^(eu • *luj> m&n mm mm, 38, 476 -48i (1993)., ^ • t 

m&M 41, 603-607 (1996).. M. Maruyama and S. Sugano, Gene, 

138, 171-174 (1994).), * 'J zS v 7&t0k<iyamz-ierg&*mfr'&t> : &xm 
^Lfc&^^-U 3**-^ y ^S(S. Kato et al., Gene, 150, 243-250 (1994).. JP 

& • mm ^mm-isms^&m. 1994^6/333), capicidnad >#-£<t3M£i§rr 

5'J>*-ftfig^(N. Merenkova and D. M. Edwards, W0 96/34981 Nov. 7, 
1996.), Ctf(Dt?:*r>mmt&%*wzrit¥1lg&m?. Carninci et al., G 
enomics, 37,327-336 (1996).. P. Carninci et al., DNA Research, 4, 61-66 

(i997).)-e£>3o znbi±?^Tn&£W<Dmmk(DCAP*mm-rz>J5m-c 

^M©m^cDNA^^ ZT^ V-Zttm-tZ>J5&L-C&Z>o £fc, Cap^h^>y7-r 



WO 99/20750 



PCT/JP98/04772 



2 

Cap+M hCDUffitetLT, Bm£tzmeim!&C>Ca.p1&'£i9>^Pn*m^Zl5 
i£ (I.Edery et al. ,MCB, 15,3363-3371(1995)) &£#*n£>ftTl^3o ZtKbUft 
LT^ ^-|icDNA(D5 , ^*C-7 1 ^ 'J>^LTCap Switch oligonucleo 
tide^rT' — — ;i/^-y-SCap Switch oligonucleotide$£T-fe£Cap Finder (Clonte 

Sri^WT-^D — Wtf 6fe#>C N cDNA-7 -i "7^y 'J-l^lti^g^D-Vi)^ 

iiitu, cdna^ 4 *)-frb®mm^±&cMk*mn-rzzL£&-i:g2>b 

U-fcffl^tifcJU S5S!i*T-^ScDHA*aiRL-5Si:%ifco !P*>s ^fi*® 

^cDNA^ 77 U -ftf£i£T-cDNA7 -f V 5'*J««t 

©DNA JMIS?'J£&£U dCDtfi«rtt:H|RHJ&=i K >#&&f Z *»SA»*»«f 
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UTteu r A. G. Pedersen, Proceedings of fifth international c 
onference on intelligent systems for molecular biology, p226-233 (1997), 
held in Halkidiki, Greece, June 21-26, 1997. j „ x^v>5: 

"FWTZrvr^&lZ^T&mftZtiTZXte^Zlfi ( TGene Finderj V. V. 
Solovyev et al., Nucleic Acids Res., 22, 5156-5163 (1994).. rQrailj Y. 
Xu et al., Genet-Eng-N-Y., 16, 241-253 (1994)) . C®7o^7A^^4 

* LTn ^^rft^ODSS^cDNAv U -ftf&mT'ftm UfecDNA^ -f ^ U-tC 
^£ft£^D->©5'iM©i&Sia?y£&£U d<Z)PJ3gL£V7 b£*iJffll>T 

JHWSfcH:, £t\ 7i- 'J 3 dr-r >y 7*^-r±ScDNA^ ^ U-^ftMb, cDNA 
WE 5' *MJ©&££ ;h,/zi£M?iJ t o MIRI^p K <fc tf^© 

^■ri)i:*ij^^nfe^o->^cDNA7^y^ u-^?>iiiR-rn«> mmmz% 



WO 99/20750 PCT/JP98/04772 



ip*^ *mm±. cdna^ >j-*^^±scDNA^P->^M4R-rs^^ 

^i^'^ntct !931tRLfccDNA^D->>g:^1-?.^i:$:^i:-r^^±ftcDNA 

(1) ^ficDNA^P->^#g|-r^^?£-efeoT. 

(a) cDNA^-Y 7^ <J -tC^|[ft£cDNA^ d->©5' £i«ia?iJ£&£-r 

(b) (a) T-&5££ftfci£SiB5*J*t3^#}i3 K>##£*lT^£frg:fr£, 

(c) (b) T"^=J \<>ft^l£nx^2>tmfe£titz?u-->*mtRT2>JLms 

(2) cDNA^-fT^ U-#^±:MCDi§^cDNA7^ 7^ U -fPt£?£K £ t) <1f 

(3) cDNA^:/7 >J-*5mRNA©CAP*f^g5-r2.xa^^tf^&tCj; 

(4) ^gcDNA^^ 7v U-^^^-T^^^T-fe^T. 

(a) cDNA^-T 7*7 U-t^^n-ScDNA^P-X^S'^^^JSSlS^J^^^T 

(b) (a) T-^^n/i^Sia^J4>(3^P h*>#^3;*i-<:^£frSfr£. ^ 

(c) (b) xm^ \t>fi^%nx^2,tm%.£titz?n->zmiR?z^m, 

(d) (c) ■CHR^tifc^D-V^l^tSlS, 

(5) cDNA^'f 77 <J-*s^ft*©S^cDNA7^^7 U-ftM&tCi Dft 

ti£tifc&©T-&3, (4) (ci2«©^^ 

(6) cDNAt^ 77 U-^mRNAOCAP^^Ifp-r^Xli^^tr^tcfc t>{t§£ 
£ftfc£Gr>T-&^ (4) ^H3«g07j&, 
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(7) (4) izmmcDJsmz&QttmLozcmh^jT-?*)-^ 

*?&W\±. cDNAv 4 /7 'J — , ifrlZ^McDBV^DNA^ -i 7v "J -©5' Slit 

<fct>, ?J]^^(w^:ScDNA^o->^#^|-r^>Ci:#-r-§, £ £ £#111 bfccDNA^ 
n->^S^-r^)Ci:tc < tt)^S* : &^^t:ffi«)fecDNA^^ 'J-=&#lt 

NA^P->SMiRTi)^^li. (a) cDNA^ 7 =7 U -tC-^tiScDNA^ 
©5'*5ifr^i&Sifi?iJ£&;£-r3X*§. (b) ^^ftfei^IBSW;:^^ K 

xg N (c) gate 3 ^*>* s ^^tL^:v^shf^J^^tLfe^D->=£:MiR■r^ 

X*M£^tf 0 *^B^©^±McDNA^^ 77 U -cDtf^&ti, £ 5> lz (d) 

£ LT ^^mcDNA©#^t©^^#{ST T^>i:#X.^tl^o 3 © £ #>cDNA^7 D — > 
t±, ±IB©^±WO^V>cDNA^^ 7^ ■J -Of^fig^ 0>J;ii;£. mRNACDCAP tzRNA 

>j >*-^^m^t^-r^.^-u=i3r^>yr?£(es • 3tLiu, m&n mm. mm, 

38, 476 -481 (1993)., • mm, m&n tm. mm, 41, 603-607 (1996)., 
M. Maruyama and S. Sugano, Gene, 138, 171-174 (1994).). t^^f^S 
^0kayama-Berg^^^^rt>i±Tgi^bfe^^-Ui3^r^^7'^(S. Kato et a 
1., Gene, 150, 243-250 (1994).. MM • H« #P8¥6-153953 1994^6E3B). 
CAPKDNA'J >*-%<fb^*g^-ra 'J >*— {b3MS^i*(N. Merenkova and D. M. 
Edwards, WO 96/34981 Nov. 7, 1996.). CAP© tT^^gptC <£5*-V s»7ft* 
f^§S&(P. Carninci et al., Genomics, 37,327-336 (1996)., P. Carninci et 
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al., DNA Research, 4, 61-66 (1997).), iSt fcttHelafflWS©Cap«S^^ >/V7 
H£ffl^£>£& (I.Edery et al.,MCB, 15, 3363-3371(1995)) . Cap Switch olig 
onucleotidei£T*fc£Cap FinderfclJ; 0 itm^ntz^ 4 7^ V —tZte&TZ C ilifi 

cDNAv-C 7^ U— *»e>©cDNA*D — >©l£«t± N zfcjft (J.Sambrook.E.F.Frits 
ch & T.Maniatis, Molecular Cloning Second edition, Cold Spring Harbor Labo 
ratory Press 1989) fc£fc:3Ett©fi*ifeK«fc D ^5 d So 

A X*± © DNA > - >r > X U. m £ DNA > - >r > It - H £ ffl l> X * i£ £ «fc »j fr -5 - i: # 

^•Ttltf-t-^-Cfe*^ 500i£S*IJg> $c,t{±300ffiSegE©JfiSiByj©*^-C 

X 7> ^ £ ^ L U IMSiE^J 4> £ =3 K > #??&-r Z> £ titz 
cDNA^d — >fc* N iim&s ^^cDNAT-febs XnT'tfteV^fc^Ufilfi 
IB^J^CIH^n F>##:£L&v^¥J^£ftfccDNA*D — >« s ^^gcDNAT- 
fc&o ^t, cDNA^ 75 U-fr?> ^SiS^J^tc^n K>^#<£T5 i:$iJ£ 
£ tlfccDNA^ D - > *MlR-f 2> d i: £ £ D . 3M5fcJ t^ScDNA* S H h 
#aJt£T-&3o H^tHJS^J 1 tiB3tt©7 p a^-7A4ffl^fcjB*f©-0!l"ett. ^ 
^S^*s5U©cDNA7-f 77 »J-£* o->©3B#?fczffl^fc«^ 
0.94) £0.5W±-e*n->£3iiR-r£i:£D — >©^±Wtt71X, *pt£0. 
701^±T* ? P — > £ MiRT 5h^D-> ©to£S*8:77JU ^^7^0. 80I^±T* *7 
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A£^T*3T©SV^P->£iiiR-r3£ tK E5^flt4ST?£5££cDNA 

mmm 1 3 cdnacdhsrm^^ h > § ro ^7 a©«mki 

£M]T£o ^$J&, g^fttcfct, A) *BLfcATG©iiiofl!l©-£»5fflfK*S (tSUO 

-blooms) (D*ti?ti(z&ttzmmmw,e> u£©i§$&h, b) *BL£ATG©tt 
^imiTi5<o rn^Att, zm^©^ti$B*ffl^T!Sf5ii$fe=i k> 

A3i^7 V>^?|j7*D^*^ ATrfc^Gene Finder (Solovyev V.V., Salamov 
A. A. ,Lawrence C.B. Predicting internal exons by oligonucleotide composit 
ion and discriminant analysis of spliceable open reading frames. Nucleic. 
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Acids Res. (1994) 22: 5156-63.) *T-^£ftTU£i&ff^JS'J##r£, ^Jfi'J© 

-C^,^^7 : -^-*fflv^fc^cDiE^^5l^h•r^) t ^811" So -^-x^ 
tifccDNAiB5tIBR)=t-t:^*tiS«-ATGt:J*L-CfflaRli*&3 K > L£©flt*#ffi*i£ 

OMoy-fX^gUTilff^lTOiPCfSlM, ^£©{il£IS;£-f €>„ WiK, 
40%© y W X£E^Rlt6T-$>fti£, U§lM*0.6*£&ffl-r*Uj:«fc^o S*>©;^* 

±&(Dkt mmm £ j%<*<b l t * n**i3o©««£ m v ^ fc . 

MfltfJCfcJU A) *g LfcATG©t^ffly©-^laiaM^ (»10-~»100JHa£) ©^ 
(©*T*, ATG©300i&»££-£) {C#^nS6HS^©«)gt»^ (DATG©tu^ 

50JMSt;:^£ft&6iM&:fc^©$PiJ£tit*B©M. h*t> L£©*i*i 

(ATG©&30T^ /g? (90iM) ©4>"C* , g^i^7K^^8r < y ^^©iBfek^Ji 
«) £ffl^fco B) *SLfcATG©f*j£©fflaRPa*&3 K>?>L$©«*i:L 

Tiis ®ATG©14«SHlf*»?>5mS»*7?©«S«6l*I©3fiSS:^4*{fti: LfcS2> 
tT^'Jtf^ (2)ATG©Hu(c|^Il:7U-AT*fl!l©ATGASfeS^^VAA» (fcftlik 

7P^A (ATGpr) IzXZMtfr 
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eratocarcinoma NT-2. neuroblastoma SK-N-MC) i !9 Xffll (J.Sambrook,E.F.Fri 
tsch & T.Maniatis, Molecular Cloning Second edition, Cold Spring Harbor La 
boratory Press 1989) tiffiOfimz £ !3mRNA£ttJ£S bfco &<^tr. t'J^^y 
7*9 (IE3*J#^ : 1) ^UrfdTTyT*^ — r^-f -r- (IB?iJ#-«§ : 

2) (^li:2<D^) . |fcli7>^A7m-77^7- (IS?'J#-*§ : 

3) (^3i:i4o^) ^ffl^M • ti, gam mm. mm, 41, 

603-607 (1996).p606> Y.Suzuki et al. , Gene, 200, 149-156(1997)) <DH3*E£fi£V\ 
BAP&LIIL TAPJM, *>3 IP§-iiicDNA<D^I^RNA<D|&*£?To fz a 

ftl\Ts PCRCCJ; t)2^$IcDNA(C^L. Sf ilTO b £ „ *UT\ Dralllt-^BJrb 
£-<**-pME18SCG, pMFL^ (^S ■ 3tllU, I6S g#m, 38, 472-481 (1 
993).p480) (;cDNA©^[p]t4*^^T^O-^>^Ufco - tl lZ <fc »3 ff fcDNA(Z-D 
l^TDNAS/ — >T >^>>^7*IS^(DyeTerminator Cycle Sequencing FS Ready Reacti 
on Kit, PE Applied Biosys terns )£ffl V^T^-^. 7)HZ bfc#o X >T > i/ > 
DNAv— ^ >it— (ABIPRISM 377, PE AppliedBiosystems)T-DNAJMSIS 
5'J£l$?#Tb£o ^n->®5'*J;^ ^iiftil ©<D^DNA|E?'J£/§?#t b£o 
£-^P->©DNAIB?iJtCO^T, b fccDNACiBHRH&n. K^ilirD^A 
(ATGpr) T-^HRFJ^n b'>^#^TS* 1 g^{3 0^|fWUfco C©iff7D^7 

M{i0.94-?Ife^o 

(1) ^-'Jl3^^ •yr&T-cDNA^f^^bfccDNACDa^, ft? 7"> 'J — 7 s 

#r 

ttfiyZ-t^—XX'&tiKDpU-^ (F-NT2RP 1000020. F-NT2RP 1000025. F-NT2R 
P1000039. F-NT2RP 1000046) iZ-o^X <DM^^ WTtit 1 ) o F-NT2R 
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P1000020(880 bp)t±x 88-690tt# i"Human neuron-specific gamma-2 eholasej 
(GenBank accession No.M22349)£*JL4@|bH4#96%, F-NT2RP 1000025 (645 bp)fcfc. 
29-64H£# r Human alpha- tubulin mRNAj (GenBank accession No.K00558) Izfcf 
{smm&WrL F-NT2RP1000039(820 bp)&, 12-820{i# rnuman mRNA for elon 
gation factor 1 alpha subunit(EF-l alpha)j (GenBank accession No. X0355 
8)tC*fb;|§|BH£#96L F-NT2RP 1000046 (788 bp)&3-788{£# rfluman M2-type py 
ruvate kinase mRNAj (GenBank accession No.M23725)£*l V^m^t^97%-^$>^ 0 

7 IZmTo 

^1 



F-NT2RP 1000020 F-NT2RP1000025 F-NT2RP1000039 F-NT2RP 100 0046 



ATG 


ATG© 


ATGpr 


ATG© 


ATGpr 


ATG© 


ATGpr 


ATG© 


ATGpr 


No. 








737 


&m 








1 


1 


0.05 


96 


<0.94> 


65 


<0.90> 


111 


<0.94> 


2 


162 


<0.84> 


148 


0.13 


154 


0.05 


174 


0.82 


3 


292 


0.05 


193 


0.05 


209 


0.11 


198 


0.19 


4 


313 


0.05 


201 


0.09 


231 


0.05 


300 


0.16 


5 


441 


0.05 


232 


0.05 


321 


0.05 


315 


0.11 



mi) < > : mm$t&^ K> 

(&2) ATG©{£B : 5' t) ©DNAIB?!]-t'©ATG©#:£ f £J»No. fc^fc 
ATG No. : 5' ») ©DNAIB?iJT-©ATG©No. fc^-To 
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S©^D — >toi>TiafSLfccDNA©ffliRiii&3 h*>fi7 , D^5A (ATGpr) T- 

fiH^n K>t-ffcbfc) o 

( 2 ) * U * T&T*cDNA£<1^bfccDNA© o *>s l^HiiSn h* >#i^^E 

b&^ci hi?-? — ^ — ^ — x~c%tft\(Dt7 v—y^-DK^xcommr^u k >©$mjt 

h^r-i'-^-^t-gEtt©^ o-> (F-NT2RP1000013. F-NT2RP1000054, 
F-NT2RP1000122) (30lxT©8?#f*£m£ WT £ ^"f (312) o F-NT2RP1000013(6 
08 bp){±l-606{ii[# r Human nuclear matrix protein 55 (nmt55) mRNAj (GenBan 
k accession No. U89867)(Z*j- Lmm&&97%. F-NT2RP1000054(869 bp)(il-869£L 
1fi r Human signal recognition particle (SRP54) mRNAj (GenBank accession N 
o. U51920){'^U^IpH4^96% n F-NT2RP1000122(813 bp)tel-813#:# Hi.sapien 
s mRNA for 2-5A binding proteinj (GenBank accession No.X76388)lC^bffil5] 

tt«tfc^fco &*>\ ^n^^D-xD5'^{pij©^i£^j*^n^niB^j# 

^ : 8, 9. 10 IZfjkiTc 
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m2 



F-NT2RP1000013 F-NT2RP1000054 F-NT2RP1000122 



ATG 


ATG© 


ATGpr 


ATG© 


ATGpr 


ATG© 


ATGpr 


No. 


&m 








teg 




1 


21 


0.05 


31 


0.12 


23 


0.07 


2 


27 


0.05 


60 


0.20 


100 


0.05 


3 


32 


0.32 


87 


0.05 


166 


0.05 


4 


56 


0.11 


97 


0.05 


235 


0.06 


5 


119 


0.10 


146 


0.05 


316 


0.05 


6 


125 


0.08 


172 


0.05 


346 


0.05 


7 


141 


0.05 


180 


0.11 


406 


0.05 


8 


155 


0.06 


218 


0.07 


431 


0.05 


9 


161 


0.06 


272 


0.05 


469 


0.06 


10 


176 


0.08 


319 


0.07 


546 


0.12 


11 


203 


0.07 


346 


0.05 


553 


0.05 


12 


290 


0.20 


363 


0.07 


574 


0.05 


13 


311 


0.16 


409 


0.05 






14 


314 


0.12 


480 


0.07 







m. 2 #7jVf«fc o U =f*^ y !7i£T'cDNA£flsfi)tLfccDNA©-5 *>s fCt*— 
^±ft©^o->{zo^T^^LfecDNA©^iR^^=r K>^«iJ7*d^7A (ATGp 
(3) ^'Jzf^r^^r^T'cDNA^^figbfccDNA©^*), £rSIIE?'J©^ D->f3 
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P-> (F-ZRV6C1000408, F-ZRV6C1000454, F-ZRV6C1 000466, F-ZRV6C1000615, 
F-ZRV6C1000670) to^TO^^^WTC^t (^3) 0 ^ti^^D 
->©5'^(PJj(D^Si2^J=&^ti^m2?'J#^ : 1 K 12, 13, 14, 15 
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S3 



F-ZRV6C 1000408 F-ZRV6C1000454 F-ZRV6C1000466 



ATG 


ATG© ATGpr 


ATG© ATGpr 


ATG© ATGpr 


No. 














1 


85 


<0.94> 


5 


0.05 


162 


<0.86> 


2 


208 


0.22 


107 


<0.87> 


182 


0.05 


3 


386 


0.05 


153 


0.05 


207 


0.08 


4 


518 


0.11 


201 


0.08 


244 


0.05 


5 


545 


0.05 


211 


0.05 


262 


0.05 


6 






236 


0.07 


303 


0.11 



F-ZRV6C1000615 F-ZRV6C1000670 



ATG 


ATG© ATGpr 


ATG© ATGpr 


No. 




737 






1 


85 


<0.94> 


120 


<0.94> 


2 


208 


0.26 


187 


0.54 


3 


386 


0.05 


312 


0.06 


4 


518 


0.09 


388 


0.05 


5 


545 


0.05 


445 


0.05 



«3^t<fco £F-ZRV6C1000408 N F-ZRV6C1000454, F-ZRV6C1000466, F-ZRV6 
C1000615. F-ZRV6C1000670(wOVm:{±. ^ft^ft85tt. 107£u 162tiU 85{iu 12 
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O&CDJ&S r Aj fr<o#i£3 r ATGj tfiMte? K>-C-fe5h*J^$tlfco ZOtz&Z 
ft P, * □ — >{*^±ScDNA^ D ->T- fo £ i:*!j®T Lfco 

fcSr«*n-> (F-ZRV6C1001410, F-ZRV6C1001197. F-ZRV6C1001472) fco^T 
£^ft^ftia?'J#-*§ : 1 6. 17, 1 8(^to 

*4 



F-ZRV6C1001410 F-ZRV6C1001197 F-ZRV6C1001472 



ATG 


ATG© 


ATGpr 


ATGCD 


ATGpr 


ATGCD 


ATGpr 


No. 






teg 






XZ3T 


1 


23 


0.05 


5 


0.24 


77 


0.25 


2 


31 


0.07 


141 


0.25 


126 


0.05 


3 


71 


0.06 


202 


0.05 


149 


0.05 


4 


178 


0.05 


219 


0.05 


194 


0.05 


5 


214 


0.05 


228 


0.05 


213 


0.22 


6 










249 


0.05 


7 










338 


0.09 


8 










344 


0.05 


9 










351 


0.05 


10 










365 


0.05 



31 4 #^-r<k^fc:F-ZRV6C 1001410. F-ZRV6C1001197. F-ZRV6C1001472(3Ol,>T 
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(a) cDNA^l' U-t^^n^cDNA^D->©5'^^^JSSIS?'J^^t- 

(b) (a) T-^^tifciMSIE^lz^n K>#^£*iT^£*pgfr£, m 

(c) (b) -egwsn i«>*«$*nT^4J:*d^#ttfc^D— >4«iR-r*xe^ 

2 . cDNA^ -tf^feMOJS^cDNA^ -< 7 5 U -fl^SSst: J: D 

3 . cDNA^-f 7-7 U-^mRNAcDCAP^I^fpr^xm^atr^StZj; t>fls»3*lfc 

4. ^^Scdna-7-t y-sf^st suffix- ftot, 

(a) cDNA^ 7^ y-C^*ixScDNA^D — >6D5'*«S3&»e>ffiSK5>J%?*^r 

(b) (a) -C&5££tlfcJg£ffi?!J*£ia*S:3 K >#^£ftT^3a»gfr£> Bl 

(c) (b) XH>&=J K>^*^^TV^5h*^J^$nfc^D->*M4R■rsx^ N 

(d) (c) -zmnztitz?u->zm&T2>z-m, 

5 . cDNA^ 4 7? L) — tf^^g^scDlftV^cDNA^ 75> U -f^M&lC «fc b 

6 . cDNA^ 77 U-#MRKACDCAPS«ffif|i-r«Xg*^tr5Er&t:J: Df^K^tlfc 
7. lt^4t:IB«©^t:J:»)f^»b-5acDNA^^^ U- 0 
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mm 

SEQUENCE LISTING 
<110> Helix Research Institute, Inc. 

<120> The method for selecting full length cDNA clones 

ft±& cDNA^D-> <DmtRJ3& 

<130> H1-806PCT 

<150> JP 09-289982 
<151> 1997-10-22 

<160> 18 

<170> Patentln version 2.0 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligo-capping linker sequence 
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<400> 1 

AGCAUCGAGU CGGCCUUGUU GGCCUACUGG 30 

<210> 2 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligo(dT) adapter primer sequence 
<400> 2 

GCGGCTGAAG ACGGCCTATG TGGCCTTTTT TTTTTTTTTT TT 42 

<210> 3 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Random adapter primer sequence 



<400> 3 

GCGGCTGAAG ACGGCCTATG TGGCCNNNNN NC 



32 
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<210> 4 

<211> 880 
<212> DNA 

<213> Homo sapiens 
<400> 4 

ATGCGCCCGC GCGGCCCTAT AGGCGCCTCC TCCGCCCGCC GCCCGGGAGC CGCAGCCGCC 60 

GCCGCCACTG CCACTCCCGC TCTCTCAGCG CCGCCGTCGC CACCGCCACC GCCACTGCCA 120 

CTACCACCGT CTGAGTCTGC AGTCCCGAGA TCCCAGCCAT CATGTCCATA GAGAAGATCT 180 

GGGCCCGGGA GATCCTGGAC TCCCGCGGGA ACCCCACAGT GGAGGTGGAT CTCTATACTG 240 

CCAAAGGTCC TTTCCGGGCT GCAGTGCCCA GTGGAGCCTC TACGGGCATC TATGAGGCCC 300 

TGGAGCTGAG GGATGGAGAC AAACAGCGTT ACTTAGGCAA AGGTGTCCTG AAGGCAGTGG 360 

ACCACATCAA CTCCACCATC GCGCCAGCCC TCATCAGCTC AGGTCTCTCT GTGGTGGAGC 420 

AAGAGAAACT GGACAACCTG ATGCTGGAGT TGGATGGGAC TGAGAACAAA TCCAAGTTTG 480 

GGGCCAATCC ATCCTGGGTG TGTCTCTGGC CGTGTGTAAG GCANGGGCAA CTGAACNGGA 540 

ACTGCCCCTG TATCGCCACA TTGCTCAGCT TGGNCGGGAA CTCANACCTC ATCCTGCCTG 600 

TTGCCGGCCT TCAACGTGAT CAATGGTTGG CTTCTCATGC CTGGCAACAA ANCTGGCCAT 660 

TGCNGGAATT TTCATGATCC TCCCCNTTGG GAAACTGAAA AACTTTCCGG AATGCCCNTC 720 

CAACTAAGTT GCAAAAGGTC TACCNATACC CCCCAAGGGG AATTCCTCCA AGGGAACAAA 780 

TNCCCGGGAA AGGAATGCCC CCCAATTNTT NGGGGGAATA AAAGGTGGGC TTTGCCCCCC 840 

CATTTTCCTG GAAAAAACNA TNAAAACCCT TGGGAAACTT 880 

<210> 5 



<211> 645 
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<212> DNA 

<213> Homo sapiens 

<400> 5 

TGTGCGTTAC TTACCTCNAC TCTTAGCTTG TCGGGGACGG TAACCGGGAC CCGGTGTCTG 60 

CTCCTGTCGC CTTCGCCTCC TAATCCCTAG CCACTATGCG TGAGTGCATC TCCATCCACG 120 

TTGGCCAGGC TGGTGTCCAN ATTGGCAATG CCTGCTGGGA GCTCTACTGC CTGGAACACG 180 

GCATCCAGCC CGATGGCCAG ATGCCAAGTG ACAAGACCAT TGGGGGAGGA GATGACTCCT 240 

TCAACACCTT CTTCAGTGAG ACGGGCGCTG GCAANCACGT GCCCCGGGCT GTGTTTGTAG 300 

ACTTGGAACC CACAGTCATT GATGAAGTTC GCACTGGCAC CTACCGCCAG CTCTTCCACC 360 

CTGAGCAGCT CATCNCAGGC AAGGAAGATG CTGCCAATAA CTATGCCCGA GGGCACTACA 420 

CCATTGGCAA GGAGATCATT GACCTTGTGT TGGACCGAAT TCGCAAGCTG GCTGACCANT 480 

GCACCGGTCT TCANGGCTTC TTGGTTTTCC ACAGCTTTGG TGGGGGAACT GGTTCTGGGT 540 

TCACCTCCCT GCTCATGGAA CGTCTCTCAG TTGATTATGG CAAGAAATCC AAGCTGGAGT 600 

TCTCCATTTA CCCAGCACCC CNGGTTTCCN CNGCTGTANT TNGAA 645 

<210> 6 

<211> 820 
<212> DNA 
<213> Homo sapiens 

<400> 6 

CTTTTTTCGC AACGGGTTTG CCGCCAGAAC ACAGGTGTCG TGAAAACTAC CCCTAAAAGC 60 

CAAAATGGGA AAGGAAAAGA CTCATATCAA CATTGTCGTC ATTGGACACG TAGATTCGGG 120 

CAAGTCCACC ACTACTGGCC ATCTGATCTA TAAATGCGGT GGCATCGACA AAAGAACCAT 180 
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TGAAAAATTT GAGAAGGAGG CTGCTGAGAT GGGAAAGGGC TCCTTCAAGT ATGCCTGGGT 240 

CTTGGATAAA CTGAAAGCTG AGCGTGAACG TGGTATCACC ATTGATATCT CCTTGTGGAA 300 

ATTTGAGACC AGCAAGTACT ATGTGACTAT CATTGATGCC CCAGGACACA GAGACTTTAT 360 

CAAAAACATG ATTACAGGGA CATCTCAGGC TGACTGTGCT GTCCTGATTG TTGCTGCTGG 420 

TGTTGGTGAA TTTGAAGCTG GTATCTCCAA GAATGGGCAG ACCCGAGAGC ATGCCCTTCT 480 

GGCTTACACA CTGGGTGTGA AACAACTAAT TGTCGGTGTT AACAAAATGG ATTCACTGAN 540 

CCACCCTACA GCCAGAAGAA ATATGANGAA ATTGTTAAGG AAGTCAGCAC TTACATTAAG 600 

AAAATTGGCT ACAACCCCGA CACAGTANCA TTTGTGCCAA TTTCTGGTTG GAATGGTGAC 660 

AACATGCTGG AACCAANTGC TAACATGCCT TGGTTCCAGG GATGGAAAAT CCCCCNTTAA 720 

GGATGGCNAT GCCATTGGAA CCCCCCTGCT TGAAGGCTCT GGANTGCATC CTANCACCAA 780 

CTCCTTCAAA TTGAAAAACC CCTTGCNCCC GCCTCCNCCA 840 

<210> 7 

<211> 788 
<212> DNA 
<213> Homo sapiens 

<400> 7 

GAGGCTGAGG CAGTGGCTCC TTGCACAGCA GCTGCACGCG CCGTGGCTCC GGATCTCTTC 60 

GTCTTTGCAG CGTAGCCCGA GTCGGTCAGC GCCGGAGGAC CTCAGCAGCC ATGTCGAAGC 120 

CCCATAGTGA AGCCGGGACT GCCTTCATTC AGACCCAGCA GCTGCACGCA GCCATGGCTG 180 
ACACATTCCT GGAGCACATG TGCCGCCTGG ACATTGATTC ACCACCCATC ACAGCCCGGA 240 
ACACTGGCAT CATCTGTACC ATTGGCCCAG CTTCCCGATC AGTGGAGACG TTGAAGGAGA 300 

TGATTAAGTC TGGAATGAAT GTGGCTCGTC TGAACTTCTC TCATGGAACT CATGAGTACC 360 
ATGCGGAGAC CATCAAGAAT GTGCGCACAG CCACGGAAAG CTTTGCTTCT GACCCCATCC 420 
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TCTACCGGCC CGTTGCTGTG GCTCTAGACA CTAAAGGACC TGAGATCCGA ACTGGGCTCA 480 

TCAAGGGCAG CGGCACTGCA GAGGTGGAGC TGAAGAATGG AGCCACTCTC AAAATCACGC 540 

TGGATAATGC CTACATGGAA AAGTGTGACG AGAACATCCT GTGGCTGGAC TACAAGAACA 600 

TCTGCAAGGT GGTGGAAGTG GGCAACAAGA TCTACGTGGA TGATGGGCTN ATTTCTCTCC 660 

AGGTGAACAC AAAGGTGCCG ACTTCCTGGG TGACNGANGT GGAAAATGGT GGCTCCTTGG 720 

GCNCAAGAAA GGTGTGAACT TCCTGGGGCT GCTGTGGANT TGCCTGCTGT GTCNGAAAAA 780 

GACATCCA 788 

<210> 8 

<211> 608 
<212> DNA 
<213> Homo sapiens 

<400> 8 

ACAGCCTGGC TCCTTTGAGT ATGAATATGC CATGCGCTGG AAGGCACTCA TTGAGATGGA 60 

GAAGCAGCAG CAGGACCAAG TGGACCGCAA CATCNAGGAG GCTCGTGAGA AGCTGGAGAT 120 

GGAGATGGAA GCTGCACGCC ATGAGCACCA GGTCATGCTA ATGAGACAGG ATTTGATGAG 180 

GCGCCAAGAA GAACTTCGGA GGATGGAAGA GCTGCACAAC CAAGANGTGC AAAAACGAAA 240 

GCAACTGGAG CTCAGGCAGG AGGAANAGCG CAGGCGCCGT GAAGAANAGA TGCGGCGGCA 300 

GCAAGAAGAA ATGATGCGGC GACNGCAGGA AGGATTCAAG GGAACCTTCC CTGATGCGAG 360 

AGAGCAGGAG ATTCGGATGG GTCNGATGGC TATGGGAGGT GCTATGGGCA TAAACNACAG 420 

ATGTGCCATG CCCCCTGCTC CTGTGCCAGC TGGTACCCCA GCTCCTCCAG GACCTGCCAC 480 

TATTATGCCG GATGGAACTT TGGGATTGAC CCCACCNACA ACTGAACGCT TTGGTCNGGC 540 

TGCTACNATG GAANGAATTG GGGCAATTGG TGGAACTCCT CCTGCATTCN ACCGTGCAGC 600 

TCCTGGGA 608 
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<210> 9 

<211> 869 
<212> DNA 
<213> Homo sapiens 

<400> 9 

ATATTAAACT AGTGAAGCAA CTAAGAGAAA ATGTTAAGTC TGCTATTGAT CTTGAAGAGA 60 

TGGCATCTGG TCTTAACAAA AGAAAAATGA TTCAGCATGC TGTATTTAAA GAACTTGTGA 120 

AGCTTGTAGA CCCTGGAGTT AAGGCATGGA CACCCACTAA AGGAAAACAA AATGTGATTA 180 

TGTTTGTTGG ATTGCAAGGG AGTGGTAAAA CAACAACATG TTCAAAGCTA GCATATTATT 240 

ACCAGAGGAA AGGTTGGAAG ACCTGTTTAA TATGTGCAGA CACATTCAGA GCAGGGGCTT 300 

TTGACCAACT AAAACAGAAT GCTACCAAAG CAAGAATTCC ATTTTATGGA AGCTATACAG 360 

AAATGGATCC TGTCATCATT GCTTCTGAAG GAGTAGAGAA ATTTAAAAAT GAAAATTTTG 420 

AAATTATTAT TGTTGATACA AGTGGCCGCC ACAAACAAGA AGACTCTTTG TTTGAAGAAA 480 

TGCTTCAAGT TGCTAATGCT ATACAACCTG ATAACATTGT TTATGTGATG GATGCCTCCA 540 

TTGGGCAGGC TTGTGAAGCC CAGGCTAAGG CTTTTAAAGA TAAAGTAGAT GTACCTCAGT 600 

AATAGTGACA AAACTTGATG GCCATGCAAA ANGAAGTGGT GCACTCAGTG CAGTCGCTGC 660 

CACAAAAAAT CCGATTATTT TCATTGGTAC AGGGGGAACA TATANATGAC TTTGAACCTT 720 

TCAAAAACAC AGCCTTTTAT TAACAAACTT CTTGGTATNG GCGACATTGA AAGGACTGAT 780 

AAATAAAGTC CACNAATTGA AATTTGGATG ACNATGNAAA CCCTTATTGA AAAAATTGAA 840 

ACATNGTCCA GTTTTACTTT GCGAAACNT 869 



<210> 10 
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<211> 813 

<212> DNA 

<213> Homo sapiens 
<400> 10 

GTTGTGGTAT CTGTATTAAG AAATGCCCCT TTGGCGCCTT ATCAATTGTC AATCTACCAA 60 

GCAACTTGGA AAAAGAAACC ACACATCGAT ATTGTGCCAA TGCCTTCAAA CTTCACAGGT 120 

TGCCTATCCC TCGTCCAGGT GAAGTTTTGG GATTAGTTGG AACTAATGGT ATTGGAAAGT 180 

CAACTGCTTT AAAAATTTTA GCAGGAAAAC AAAAGCCAAA CCTTGGAAAG TACGATGATC 240 

CTCCTGACTG GCAGGAGATT TTGACTTATT TCCGTGGATC TGAATTACAA AATTACTTTA 300 

CAAAGATTCT AGAAGATGAC CTAAAAGCCA TCATCAAACC TCAATATGTA GACCAGATTC 360 

CTAAGGCTGC AAAGGGGACA GTGGGATCTA TTTTGGACCG AAAAGATGAA ACAAAGACAC 420 

AGGCAATTGT ATGTCAGCAG CTTGATTTAA CCCACCTAAA AGAACGAAAT GTTGAAGATC 480 

TTTCAGGAGG AGAGTTGCAG AGATTTGCTT GTGCTGTCGT TTGCATACAG AAAGCTGATA 540 

TTTTCATGTT TGATGAGCCT TCTAGTTACC TAGATGTCAA GCAGCGTTTA AAGGCTGCTA 600 

TTACTATACG ATCTCTAATA AATCCAGATA GATATATCAT TGTGGTGGAA CATGATCTAA 660 

GTGTATTAGA CTATCTCTCC GACTTCATCT GCTGTTTATA TGGTGTACCA AGCGCCTATG 720 

GAATTGTCAC TATGCCTTTT AGTGTTAGAA AAGGCATAAA CNTTTTTTGG ATGGGTATGT 780 

TCCAACAGAA AACTTGANAA TCNNAAATGC NTC 813 

<210> 11 

<2U> 655 
<212> DNA 
<213> Homo sapiens 
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<400> 11 

A6ACTCTCAC CGCAGCGGCC AGGAACGCCA GCCGTTCACG CGTTCGGTCC TCCTTGGCTG 60 

ACTCACCGCC CTCGCCGCCG CACCATGGAC GCCCCCAGGC AGGTGGTCAA CTTTGGGCCT 120 

GGTCCCGCCA AGCTGCCGCA CTCAGTGTTG TTAGAGATAC AAAAGGAATT ATTAGACTAC 180 

AAAGGAGTTG GCATTAGTGT TCTTGAAATG AGTCACAGGT CATCAGATTT TGCCAAGATT 240 

ATTAACAATA CAGAGAATCT TGTGCGGGAA TTGCTAGCTG TTCCAGACAA CTATAAGGTG 300 

ATTTTTCTGC AAGGAGGTGG GTGCGGCCAG TTCAGTGCTG TCCCCTTAAA CCTCATTGGC 360 

TTGAAAGCAG GAAGGTGTGC GGACTATGTG GTGACAGGAG CTTGGTCAGC TAAGGCCGCA 420 

GAAGAAGCCA AGAAGTTTGG GACTATAAAT ATCGTTCACC CTAAACTTGG GAGTTATACA 480 

AAAATTCCAG ATCCAAGCAC CTGGAACCTC AACCCANATG CCTCCTACGT GTTTTATTGC 540 

NCAAATGAAA CGGTGCATGG TGTTGANTTT GACTTTATAC CCNATGTCAA GGGAACANTA 600 

CTGGTTTGTG ACATTTTCCT CCAACTTCCT GTCCAANCCA ATTGNATGTT TCCAA 655 

<210> 12 

<211> 599 
<212> DNA 

<213> Homo sapiens 
<400> 12 

AAAGATGCGC AGGCGCCGTG TGGCACTCGG CGGTCGAAAG GGGAGTTCAA GGAGACGGGG 60 

GCGACGCGGC TGAGGGCTTC TCGTCGGGGT CGGGGCTGCA GCCGTCATGC CGGGGATAGT 120 

GGAGCTGCCC ACTCTAGAGG AGCTGAAAGT AGATGAGGTG AAAATTAGTT CTGCTGTGCT 180 

TAAAGCTGCG GCCCATCACT ATGGAGCTCA ATGTGATAAG CCCAACAAGG AATTTATGCT 240 

CTGCCGCTGG GAANAGAAAG ATCCGAGGCG GTGCTTAGAG GAAGGCAAAC TGGTCAACAA 300 

GTGTGCTTTG GACTTCTTTA GGCAGATAAA ACGTCACTGT GCAGAGCCTT TTACAGAATA 360 
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TTGGACTTGC ATTGATTATA CTGGCCAGCA GTTATTTCGT CACTGTCGCA AACAGCAGGC 420 

AAAGTTTGAC NAGTGTGTGC TGGACAAACT GGGCTGGGTG CGGCCTGACC TGGGAAAACT 480 

GTCAAAGGTC ACCAAAGTGA AAACAGATCN ACCTTTACCG GANAATCCCT ATCACTCAAG 540 

AACAAGAACG GATCCCAGCC CTGANATCNA AGGAAATCTG CANCCTGCCA CACATGGCA 599 

<210> 13 

<211> 597 
<212> DNA 

<213> Homo sapiens 
<400> 13 

ATATCCGGAG TAGACGGAGC CGCAGTAGAC GGATCCGCGG CTGCACCAAA CACTGCCCCT 60 

CGGAGCCTGG TAGTGGGCCA CAAGCCCCCA GTCCCAGAGG CGTGATTTTC TGGCATCCTT 120 

AAATCTTGTG TCAAGGATTG GTTATAATAT AACCAGAAAC CATGACGGCG GCTGAGAACG 180 

TATGCTACAC GTTAATTAAC GTGCCAATGG ATTCAGAACC ACCATCTGAA ATTAGCTTAA 240 

AAAATGATCT AGAAAAAGGA GATGTAAAGT CAAAGACTGA AGCTTTGAAG AAAGTAATCA 300 

TTATGATTCT GAATGGTGAA AAACTTCCTG GACTTCTGAT GACCATCATT CGTTTTGTGC 360 

TACCTCTTCA GGATCACACT ATCAAGAAAT TACTTCTGGT ATTTTGGGAG ATTGTTCCTA 420 

AAACAACTCC AGATGGGAGA CTTTTACATG AGATGATCCT TGTATGTGAT GCATACAGAA 480 

AGGATCTTCA ACATCCTAAT GAATTTATTC NAAGGATCTA CTCTTCGTTT TCTTTGCAAA 540 

TTGAAANAAA CANAATTGCT AAAACCTTTA ATGCCANCTA TNCCTGCATT TTTGGGA 597 

<210> 14 



<211> 634 
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<212> DNA 

<213> Homo sapiens 

<400> 14 

AGACTCTCAC CGCAGCGGCC AGGAACGCCA GCCGTTCACG CGTTCGGTCC TCCTTGGCTG 60 

ACTCACCGCC CTCGCCGCCG CACCATGGAC GCCCCCAGGC AGGTGGTCAA CTTTGGGCCT 120 

GGTCCCGCCA AGCTGCCGCA CTCAGTGTTG TTAGAGATAC AAAAGGAATT ATTAGACTAC 180 

AAAGGANTTG GCATTAGTGT TCTTGAAATG AGTCACAGGT CATCAGATTT TGCCAAGATT 240 

ATTAACAATA CAGAGAATCT TGTGCGGGAA TTGCTAGCTG TTCCAGACAA CTATAAGGTG 300 

ATTTTTCTGC AAGGAGGTGG GTGCGGCCAG TTCAGTGCTG TCCCCTTAAA CCTCATTGGC 360 

TTGAAAGCAG GAANGTGTGC GGACTATGTG GTGACAGGAG CTTGGTCAGC TAAGGCCGCA 420 

NAANAAGCCA AGAANTTTGG GACTATAAAT ATCGTTCACC CTAAACTTGG GAGTTATACA 480 

AAAATTCCAG ATCCAAGCAC CTGGAACCTC AACCCAGATG CCTCCTACGT GTATTATTGC 540 

GCNAATGAAA CNGTGCATGG TGTGGANTCT GACTTTATAC CCGATGTCNA GGGAACATAC 600 

TGGTTTGTGA CATGTCCTCA AACTTCCCGT CCNA 634 

<210> 15 

<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 15 

AGTCTGCGGT GGGCTANCGG ACGGTCCGGC TTCCGGCGGC CGTTTCTGTC TCTTGCTGGC 60 

TGTCTCGCTG AATCGCGGCC GCCTTCTCAT CGCTCCTGGA AGGTCCCGAG CGCGACACCA 120 

TGTCGGAACC CGGGGGCGGC GGCGGCGAAG ACNGCTCGGC CGGATTGGAA GTGTCGGCCG 180 
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TGCANAATGT GGCGGACGTG TCGGTGCTGC ANAAGCACCT GCGCAAGCTG GTGCCGCTGC 240 

TGCTGGAGGA CGGCGGCGAA GCGCCGGCCG CGCTGGAGGC GGCGCTGGAG GAGAAGAGCG 300 

CCCTGGAGCA GATGCGCAAG TTCCTTTCGG ACCCGCACGT CCACACGGTG CTGGTGGAGC 360 

GCTCCACGCT CAAAGTGGAC GTCGGTGATG AAGGAGAAGA AGAAAAAGAA TTCATTTCCT 420 

ATAACATCAA CNTAGACATT CACTATGGGG TTAAATCCAA TAGCTTGGCA TTCATTAAAC 480 

GTACTCCCGT GATTGATGCA GATAAACCCG TGTCTTCTCA NCTCCGGGTC CTTACACTCA 540 

GTGAANACTC NCCCTACNAA AACTTTGCAT TCTTTCATTA ACAATGCAGT GGCTCCTTTT 600 

TTTAANTCCT ACATTAAAAA ATCTGGCAAG GCAAACAGGG ATGGTGATAA AATGGCTCCT 660 

TCCNTTGAAA AAAAAATTGC CGAACTCNAA ATNGGACTCC TTCCCTTGCA NCAAAATTTT 720 

TGAAATTCCG GAAAATCANC CTGCCCAATT CCTCCCC 757 



<210> 16 



<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 16 

ATCATTTCCT TATTTATATT TCATGTTGGA ATGCTTAAAT CGATAACCTT TGTATTTTGA 60 

AGTGCGCGAC ATGGAAGGTG ATCTGCAAGA GCTGCATCAG TCAAACACCG GGGGATAAAT 120 

CTGGATTTGG GTTCCGGCGT CAAGGTGAAG ATAATACCTA AAGAGGAACA CTGTAAAATG 180 

CCAGAAGCAG GTGAANAGCA ACCACAAGTT TAAATGAAGA CAAGCTGAAA CAACGCAAGC 240 

TGGTTTTATA TTAGATATTT GACTTAAACT ATCTCAATAA AGTTTTGCAG CTTTCACCAC 300 

<210> 17 
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<211> 313 
<212> DNA 

<213> Homo sapiens 



<400> 17 

AAAGATGGCG GCGGGGGAGG TAGGCAGAGC 
CCTCCGCTCC AGTCGCCTCC GGTCCTTCAA 
CGCCGGGTCC CGCGGGGAAA ATGGTGGAGC 
GTGAGAGTGG AATTAGTCCG AATGACTCTT 
AACTCCAATG CCTACCCCGT CAGTTCAGCA 
TTTGGAGACC GAA 



AGGACGCCGC TGCTGCCGCC GCCACCGCCG 60 
ACTCACACCT CCCGGGAGGA GCTGTCCTGG 120 
CAGGGCAAGA TTTACTGCTT GCTGCTTTGA 180 
TGATATTGAT GGTGGAGAtG CANGGCTTGC 240 
NTCAGTGCCA CTTANTGCAT TANAACTANG 300 

313 



<210> 18 



<211> 667 
<212> DNA 
<213> Homo sapiens 



<400> 18 

ACTGCCGGGC TCGGCGTGAG TCGCTGCGGG GCTGACGGGG TGGCAGTGCG GCGGGT-TACG 60 

GCCTGGTCAG ACCATAATGA CTTCAGCAAA TAAAGCAATC GAATTACAAC TACAAGTGAA 120 

ACAAAATGCA GAAGAATTAC AAGACTTTAT GCGGGATTTA GAAAACTGGG AAAAAGACAT 180 

TAAACAAAAG GATATGGAAC TAAGAAGACA GAATGGTGTT CCTGAAGAGA ATTTACCTCC 240 

TATTCGAAAT GGGAATTTTA GGAAAAAGAA GAAAGGCAAA GCTAAAGAGT CTTCCCCAAA 300 

ACCANAGAGG AAAACACNAA AAACAGGATA AAATCTTATG ATTATGANGC ATGGGCAAAA 360 

CTTGATGTGG ACCGTATCCT TGATGAGCTT GACAAAGACG ATAGTACCCA TGAGTCTCTG 420 
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TCTCAAGAAT CAGAGTCGGA AGAAGATGGG 
TTAAAAGAAA AGGGCNATAA ATACTTCCAC 
ACACNAAAGG CNTGGATGCC GATCCATATN 
CATATTTTAG ACTGAAAAAA TTTGCTGTTG 
TGAAATA 
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ATTCATGTTG ATTCNCNAAA GGCTCTTGTT 480 
AAGGAAAATA TGATGAAGCA ATTGACTGCT 540 
ATCCCGTGTT GCCAACGAAC ANAACNTCCG 600 
CTGAATCTGA TTGTTATTTAN CANTTGCCT 660 

667 
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